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Summary. — X-ray observations of clusters provide key information on the dark 
matter, on the formation of structures in the Universe, and can be used to constrain 
the cosmological parameters. I review our current knowledge, with emphasis on 
recent XMM and Chandra results. 



Introduction 



In the standard CDM (Cold Dark Matter) cosmological scenario, initial density fluctu- 
ations, generated in the early Universe, grow under the influence of gravity. The Universe 
gets more and more structured with time. At large scale, the matter density distribution 
exhibits a web-like topology with expanding voids surrounded by contracting sheets and 
filaments. Massive clusters of galaxies, located at the crossing of filaments, define the 
nodes of this cosmic web. Clusters of galaxies are the largest collapsed structures, with 
masses ranging from 10 13 Mq for small groups to 10 15 M© for the richest clusters. Be- 
cause of their size, their mass content reflects that of the Universe: ~ 85% of the mass 
is made of Dark Matter. The main baryonic cluster component is a hot X-ray emitting 
intracluster gas, as shown by the first X-ray images obtained with the Einstein satellite 
PQ. Only a few percent of the mass in clusters lies in the optical galaxies. 
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In the CDM scenario, the amplitude of the initial density fluctuations decreases with 
increasing scale. As a result, low- mass objects form first and then merge together to 
form more massive objects. Clusters of galaxies are the last manifestation of this hier- 
archical clustering. They started to form in the recent cosmological epoch (z ~ 2) and 
the cluster population is continuously evolving. Individual clusters are not immutable 
objects. A cluster continuously accretes matter and smaller groups along the filaments, 
and, occasionally, merges with another cluster of similar mass. 

Clusters of galaxies are key objects for cosmological studies (see the review 0). By 
studying the properties of clusters of galaxies we can test the scenario of structure for- 
mation and better understand the gravitational collapse of the Dark Matter and the 
baryon specific physics. Furthermore, because the history of structure formation depends 
strongly on the cosmology, studies of cluster samples can also constrain cosmological pa- 
rameters. In this course, I will focus on the X-ray observations relevant for these issues 
and on properties which can also been studied with SZ observations. In particular, I will 
not discuss abundance measurements, although they are important for our understand- 
ing of the history of galaxy and star formation and of the heating and enrichment of 
the intra-cluster medium. This course cannot be exhaustive. More information on X-ray 
clusters can be found in the reference book "X-ray emission from clusters of galaxies" 
[3] and in more recent books 0J|S] and reviews quoted in the text. 

2. Observing clusters in X-ray 

2T. X-ray emission of clusters. - The Intra-Cluster medium (ICM) is a hot, tenuous 
and optically thin plasma. The gas density varies from ~ 10~ 4 cm~ 3 in the outer regions 
of clusters to a few ~ 10 -2 cm~ 3 in the center. The ICM has mean temperatures in the 
range kT = 0.5 - 15keV, reflecting the depth of the potential well (kT oc GM/R). Note 
that the ICM is not an isothermal plasma, although temperature variations are usually 
small (see below). The ICM is enriched in heavy elements, with typical abundances of 
1/3 the solar value. At the temperature of the ICM, H and He are fully ionized. Most 
electrons come from these two elements. The electron density is nearly independent of 
the ionization state and is given by n c ~ 1.2nn, where «h is the hydrogen density. The 
ionization stage of the other elements depends on the temperature. 

The X-ray emission is that of a coronal plasma at ionization equilibrium for de- 
tailed description]. For a volume element dV of electronic density n e , temperature T, 
abundances [Z/H], the number of photons emitted by unit time in the energy range 
[E, E + dE) can be written as 

(1) dN(E) = n 2 c e(E, T, [Z/H])dEdV 

where e(E,T, [Z/H]) is the photon emissivity at energy E. The intensity scales as the 
square of the density, because all emission processes (like the Bremsstrahlung emission) 
result from collisions between electron and ions. 
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Examples of X-ray spectra for typical cluster temperatures are shown in Fig.^ Due to 
the high temperature, the continuum emission is dominated by thermal Bremsstrahlung, 
the main species by far contributing to the emission being H and He. The emissivity of 
this continuum is very sensitive to temperature for energies greater than kT and rather 
insensitive to it below. This is due to the exponential cut-off of the Bremsstrahlung emis- 
sion. Indeed, it scales as g(E,T)T~ 1 ^ 2 exp(—E/kT), where g(E,T), the Gaunt factor, is 
a gradual function of kT. The only line that clearly stands out at all temperatures is 
the Iron K line complex around 6.7 keV (see Fig.^l. We can also observe the K lines of 
other elements (Z > 8, H and He-like ionization states), as well as the L-shell complex 
of lower ionization states of Iron. However the intensity of these lines rapidly decreases 
with increasing temperature. Except for the cool clusters (kT <J 4keV) or in the cooling 
core present in some clusters, one cannot expect to measure the abundance of elements 
other than Iron because they are completely ionized. 

2'2. Extracting Physical information from X-ray observations. - From above, it is 
clear that X-ray observations give access to the two characteristics of the ICM, which are 
the density and the temperature. The shape of the spectrum determines the temperature 
( 1 ), whereas the normalization provides the emission measure EM = J n\dV . 

2'2.1. Gas temperature. Its determination requires spectroscopic data. The temper- 
ature is derived by fitting the observed spectrum (as in Fig. [SJ with a thermal emission 
model convolved with the instrument response (i.e., taking into account how the effective 
area and spectral resolution vary over the energy range). The model spectrum at Earth 
is computed from the emitted spectrum, taking into account the cluster redshift and the 



( ) and also the heavy element abundances from the line equivalent widths and possibly the 
redshift from the line positions 
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galactic absorption (which mostly affects the emission below 1 — 2 keV) . It is important 
to keep in mind that: 

• The temperature is constrained by the position of the exponential cut-off in the spec- 
trum. In order to have a proper determination of the temperature, we need spectroscopic 
instruments sensitive up to energies greater than kT, i.e., typically 10 keV. 

• The ICM is not strictly isothermal. This means that a temperature inferred from an 
isothermal fit to the data is actually a 'mean' value along the line of sight and in the 
considered cluster region. This temperature is not simply, as often thought, the 'emission 
weighted' temperature [S]. 

2 '2. 2. Gas density. The emissivity is not very sensitive to kT at low energy. Therefore 
X-ray images or surface brightness profile (as in Fig. extracted in a soft energy band 
(E <J 2 keV) are used to determine the gas density distribution. X-ray images reflect 
the ICM morphology. Note that one must not forget projection effects and that density 
contrast are enhanced because the X-ray emission increases as the square of the density. 
The emission measure along the line of sight at projected radius r, EM(r) = J n 2 dl, 
can be deduced from the surface brightness, S(9): 

(2) EM ^ = A "%% m Wlth r = dA ( Z )0 

where cIa(z) is the angular distance at the cluster redshift z. A(T, z) is the emissivity 
in the considered energy band, taking into account the absorption by our galaxy, the 
redshift, and the instrumental spectral response. Because A(T, z) depends only weakly 
on the temperature in the soft band, it is essentially insensitive to temperature gradients 
(except for cool systems, [7]) and one can generally use the average cluster temperature. 

The gas density radial profile n s (R) is usually derived from Eq. [21 assuming spher- 
ical symmetry. In that case EM(r) = f°°n^(R) RdR/\/R 2 — r 2 . One can use de- 
projection techniques or parametric models fitted to the data. A popular model is 



the so-called isothermal /3-model: n(R) = uq 1 + (R/R c ) 2 , which gives S(6) 

-3/3+1/2 



-3/3/2 



So 



i + {e/e c f 



This model fits reasonably well cluster profiles at large radii, 



but it underestimates the density in central cooling core of clusters (see also Sect. I4"4|) . 

2'2.3. Cluster properties. Depending on the quality of the data, various physical 
cluster properties can then be derived. The easiest to determine is the X-ray luminosity, 
which is computed from the observed flux in the instrument energy band ( 2 ) A higher 
statistical quality is required to measure a surface brightness profile or a spectrum. From 
these two quantities one derives the global temperature, the gas density radial profile and 
therefore the gas mass. If spatially resolved spectroscopic measurements are available, 



( 2 ) Conversion from count rates to bolometric luminosity depends on the temperature. The 
usual way is to use a Ly^-T relation. 
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Fig. 2. - The light pass in the two XMM telescopes equipped with the EPIC and RGS experi- 
ments. After collection by the mirror module (58 Wolter shells) about 40% of the light is focused 
to the EPIC CCD camera, while a similar amount is dispersed by a Reflection Grating Array. 
The diffracted beam is imaged by a CCD camera on the Rowland circle for high resolution 
spectroscopy. In the third telescope, all the light is collected by the EPIC CCD experiment. 
The Figure was reproduced from ESA SP-1097. 



one can also derive the temperature profile. This gives access to the entropy profile, key 
information on the history of the ICM (see Sec. 14 '40 . One can also deduce the total mass 
profile from the Hydrostatic Equilibrium Equation: 



(3) M(r) = -^ 



d In n d\nT 



d\nr d In 7 



where G and m p are the gravitational constant and proton mass and fj, ~ 0.61. Note that 
an approximation of the total mass can be obtained by simply assuming isothermality. 

2'3. Modern X-ray observatories . - X-rays are absorbed by the Earth's atmosphere. 
Therefore, X-ray observatories are put on board satellites. Two X-ray satellites are now 
in operation, XMM-Newton and Chandra. They are fully described in [HI El HOI ITT] and 
|12j . The general concept in the same. The X-ray photons are collected and focused by 
grazing incidence telescopes (Fig. ^J. The focal plane is equipped with CCD cameras, 
allowing for the measurement of the position and energy of each individual incoming 
photon (Fig. 0). This permits spatially resolved spectroscopy at medium resolution 
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(AE = 60— 140 eV), in the energy range between 0.3 and 10 keV. Spectroscopy at higher 
resolution is performed using gratings, the 'dispersed' spectrum being read by CCDs 
(Fig. [5J ■ Because this is dispersive spectroscopy, the sensitivity is degraded, the spatial 
information is essentially lost and the spectral resolution rapidly deteriorates as the 
extent of the source increases. The spectral resolution of the XMM grating instruments 
(XMM/RGS) is A(E)/E = 0.1 - 0.3% in the 0.3 - 2 keV energy band for point sources. 
Such spectroscopy is limited to clusters with sharply peaked emission profiles. It is 
restricted to the central # <J 2' region. The sensitivity of Chandra grating instruments is 
too low for cluster studies. 

The XMM-Newton and Chandra observatories are complementary. Chandra has an 
extremely good spatial resolution of A9 = 0.5" (compared to 8" for XMM-Newton). 
The strength of XMM-Newton is its exceptional collecting area and thus sensitivity: 
three high-throughput telescopes are operating in parallel. The Field of view is 30' in 
diameter, well adapted to cluster studies. Chandra has only one telescope, a smaller field 
of view of 17'X17' (for the ACIST instrument) and an effective area typically 3(5) times 
lower than XMM-Newton at 1.5(8) keV. 

As compared to the previous generation of satellites, XMM-Newton and Chandra 
represent a giant step forward in term of sensitivity and spatial resolution. The ROSAT 
satellite ^3] had good imagery capability (A9 = 15") but much lower effective area 
and very poor spectroscopic capability. The high energy cut-off of the telescope was 
E ~ 2 keV, so that accurate temperature measurement was limited to very cool clusters. 
ASCA was the first X-ray observatory [2J with telescopes working up to 10 keV and a 
CCD camera at the focal plane at one of the telescope (the other telescopes were equipped 
with proportional counters). As compared to spectroscopy made before with collimated 
spectrometers (like with the EXOSAT or GINGA satellite), the gain in sensitivity was 
very important. It was also the first time one could do spatially resolved spectroscopy of 
clusters. However, this was limited by the relatively large and energy dependent Point 
Spread Function. The spatial resolution of Beppo-SAX was better, but above all it had 
the capability of observing sources over more than three decades of energy, from 0.1 to 
200 keV |T£1- RXTE ^1 nas n0 imaging capability but was also used to study hard 
X-ray emission from clusters. 

With XMM-Newton and Chandra: 

• We can map the gas distribution in nearby clusters from very deep inside the core, 
at the scale of a few kpc with Chandra |17| . up to very close to the virial radius with 
XMM-Newton. Temperatures profiles (and thus mass profiles) can be measured over a 
wide radial range, down to 0.001i?2oo with Chandra JS] up to close the virial radius with 
XMM-Newton, even in low mass systems ^J]- Last but not least, we have now precise 
temperature maps for unrelaxed objects [201 12 1| and we can resolve very sharp density 
features |22j. 

• We can measure basic cluster properties up to high z (z ~ 1.3) and down to the 
ROSAT detection limit (with XMM-Newton). This includes morphology from images, 
gas density radial profile, global temperature and gas mass (e.g. |23U24p . As an example, 
Fig. □ HH show the Chandra and XMM-Newton observations of RDCS 1252.9-2927 at 
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Fig. 4. - Surface brightness profile of RDCS 
1252.9-2927 measured with Chandra (data 
points), with best-fit model (solid line) and 
residuals. Figure from |24| 



Fig. 3. - Image of the cluster RDCS 1252.9- 
2927 at z = 1.24. Contours of the Chandra 
emission are overlaid on a composite VLT im- 
age. Figure from [24] 
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Fig. 5. - X-ray spectrum of RDCS 1252.9-2927 
(data points) and best-fit thermal model (solid 
line) from XMM-Newton observation. Note 
the redshifted Iron line. Figure from 12 1| 



z = 1.24 [24] . Total mass and entropy can be derived assuming isothermality [21]. For 
the brighter distant clusters, crude temperature profiles [23] or maps [26] can be obtained. 

• We can find and identify new clusters Clusters at all redshifts appear as extended 
sources in XMM-Newton and Chandra images. However, only Chandra has the capability 
to perfectly remove point source contamination. The typical flux limit for XMM-Newton 
serendipitous surveys is 10 -14 ergs/s/cm 2 in the 0.5 — 2 keV band [2E|> about 3 times 
lower than with ROSAT. 
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3. Hierarchical Cluster Formation 

3'1. Substructures and merger events in the local Universe. - The early Einstein images 
showed that clusters present a large variety of morphology, from regular clusters to very 
complex systems with multiple substructures Since substructures in the ICM are 

erased on a typical time scale of a few Gyr, their detection is the signature of recent 
dynamical evolution. This manifold morphology indicates a variety of dynamical states 
in the local cluster population. These observations were consistent with the idea that 
clusters are still forming to-day, as expected in the hierarchical formation model. 

The presence of substructures by itself does not tell much about the cluster formation 
process. For instance, we expect substructures both if clusters form through mergers of 
smaller systems (CDM scenario), or by divisions of larger systems (top-down scenario). 
A direct support of the hierarchical CDM model was the observation of specific signa- 
tures of merger events in density and temperature maps, as predicted from numerical 
simulations (see the detailed reviews j3U] on the physics of merger events and |3J on 
X-ray observations of these events with ASCA and ROSAT). The first unambiguous 
signature of a merger event was provided by the ROSAT observation of gas compression 
in the subcluster of A2256, as expected if this subcluster was falling towards the main 
cluster |32|. The ASCA observation of a temperature increase in the interaction region 
between the sub-clusters of Cygn-A, was even clearer evidence that two sub-clusters were 
colliding It also showed that shocks induced by mergers do contribute to the ICM 
heating. This is is a strong prediction of numerical simulations of cluster mergers |34II35| . 
When two sub-clusters collide, the interaction region is heated and compressed. As the 
relative motion in mergers is moderately supersonic (v ~ 2000 kms/s), shocks are driven 
in the ICM. As a result, the gas of the final cluster is heated to its virial temperature, 
which is higher than the virial temperatures of the initial sub-clusters. 

3'2. Cluster formation at high z. - With XMM-Newton and Chandra we now extend 
the study of substructures to the distant Universe. As expected from the hierarchical 
formation models, we observe a variety of morphology (and thus dynamical state) up 
to very high z. A clear case of a double cluster is RX J1053. 7+5735, observed with 
XMM-Newton at z = 1.14 This cluster is very probably a merger between two nearly 
equal mass systems. In contrast, RXJ1226. 9+3332 (z = 0.9) observed with XMM-Newton 
is a massive cluster (the temperature is kT =11 + 1 keV), with a very regular morphology 
(Fig. [HJ indicative of a relaxed state |37]. Note that the existence of massive and relaxed 
clusters at so high z is expected in low fl Universe but is very unlikely in a critical 
= 1 Universe. We also see unambiguous evidence of merging activity up to z ~ 0.8 
(Fig. UJ . The crude Chandra temperature map shows a temperature increase between 
the two subclusters of RX J0152. 7-1357, indicating that they have started to merge 
Interestingly, when the two subunits will have completely merged the cluster will have 
the mass of Coma. 

3'3. The detailed physics of merger events and the effect of the Large Scale environ- 
ment. - With Chandra and XMM-Newton, the temperature structure in nearby clusters 
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Fig. 6. - RXJ1226.9+3332, a relaxed massive 
cluster at z = 0.89. The contours of X-ray 
emission detected by XMM-Newton is overlaid 
on a Subaru I-band image. Figure from |37) 
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Fig. 7. - The merging cluster RX J0152. 7-1357 
at z = 0.83. The Chandra X-ray contours 
are overlaid on a Keck II I-band image. Fig- 
ure from |26) . The Chandra temperature map 
shows a temperature increase between the two 
subclusters indicating that they have started 
to merge 



can be mapped with unprecedented accuracy. This permits much deeper investigations 
of the dynamical process of cluster formation. 

3'3.1. Shocks and Cold Fronts. We now understand better what happens during 
a merger event. The Chandra observation of the merging cluster 1E0657-56 provides a 
clear 'text-book' example of a shock l 3BJ as shown in Fig. |H1 and Fig. OH The derived 
Mach number is about M — 2 . Evidence of strong shocks is rare, but they might be 
difficult to detect due to projection effects More surprisingly Chandra revealed the 
presence of "Cold fronts" in merging clusters, first discovered in A2142 [22] and A3667 
[3*§| . Cold fronts are contact discontinuities between the cool core of a subcluster moving 
at near sonic velocity and the surrounding main cluster gas. Across the discontinuity 
there is an abrupt jump of the gas density and temperature (Fig. [5J|. The pressure is 
approximately continuous (Fig. EJ), which shows that the discontinuity is not a shock. 
1E0657-56 exhibits both a cold front and a shock, located ahead of the cold front (Fig. [HJ 
El). The observations of cold fronts show that the cool core of infalling subclusters can 
survive the passage through the main cluster core, whereas the gas in the outer region of 
the subcluster is stripped by ram pressure ■ Evidence of gas stripping during mergers 
is also provided by the observation of cool trails behind some merging subclusters as, 
for instance, in the XMM-Newton observation of A1644 Ultimately the cool core is 
destroyed: for instance the shape of the subcluster remnant in 1E0657-56 shows that it 
is being actively destroyed by gas-dynamic instabilities 38 . 
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Fig. 8. - Chandra image of 1E0657-56. Note 
the "bullet" apparently just exiting the clus- 
ter core and moving westward. The bullet is 
preceded by an X-ray brightness edge that re- 
sembles a bow shock. Figure from 
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Fig. 9. - Surface brightness profile in a 120 ° sector centered on the bullet and directed westward 
(Left) and corresponding temperature and pressure profile (right). The first edge is a cold front 
while the second is a shock. Figure from |38| 



The observation of cold fronts has further implication for the physics at play in clus- 
ters. The very steep temperature gradient and smooth surface of the cold fronts imply 
that thermal conduction and Kelvin-Helmotz instabilities are suppressed, probably by 
magnetic fields |3I1 1551 133] . 

3'3.2. Formation history and dynamical state. It appears clearly now that the dy- 
namical state of clusters does not always simply depend on the most recent merger event. 
Multiple on-going merger events are seen in several clusters, for instance in Coma 05] and 
A2744 44 . The gas relaxation time can be longer than the interval between successive 
merger events, especially in dense environments |21j . For instance, the XMM-Newton 
observation of the double cluster A1750 [2JJ, located in a super-cluster, revealed an in- 
crease of temperature in the region between the two sub-clusters A1750 C and A1750N 
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Fig. 10. - The double cluster A1750. The con- 
tours of the XMM-Newton emission are over- 
laid on the temperature map. An arc-like 
hot region can be seen between the two sub- 
clusters, A1750N in the north and A1750C 
in the South. The two clusters have started 
to collide and the gas is being shocked and 
compressed. The hot regions in A1750 C are 
regions of shocked gas from an older merger 
which occurred about 1-2 billion years ago. 
Figure from |21|. 
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Fig. 11. - The merging cluster A3921. The 
contours of the XMM-Newton emission are 
overlaid on the temperature map. In con- 
trast to A 1750, the hot shocked region is not 
perpendicular to the line joining the centre 
of the two subclusters, but nearly parallel to 
it. This is interpreted in terms of an off-axis 
merger: the less massive sub-cluster infalling 
from the SE has already passed the core of the 
main cluster with a non-null impact parame- 
ter. Figure from [48 1 



fFig. imp. This indicates that they have started to merge. However, the interaction is 
too recent to explain the temperature substructures observed in A1750C fFig. HOp . They 
are likely to result from a more ancient merger. 

Various observations confirm the importance of the cluster large-scale environment 
on its formation history. We have now unambiguous evidence that sub-clusters are 
accreted preferentially along the direction of filament (s) connecting to the cluster, as 
expected in the standard scenario of cluster formation. It is provided, for instance, by 
the observations of A85 02] i A1367 0f>| an d Coma 07|02|- The merger of sub-clusters 
can occur with an non-zero impact parameter, e.g. in A3921 (Fig. [nj 4§j) and Coma 
|43j . probably as a result of large-scale tidal torques. 

3'4. Statistical studies of cluster morphology. - Beyond the detailed study of spe- 
cific clusters, statistical analysis of cluster morphology provide important information 
on structure formation. Till now, such studies have been restricted to X-ray images. 
They will certainly be extended in the future to temperature maps, now available with 
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XMM-Newton and Chandra. These studies first require to design quantitative measure- 
ments of substructures. This is not trivial and many estimators can be used: centre-shifts 
[4"§| , power-ratios [SHj , j3 and Lee statistics [5T] .... The power-ratio method is of par- 
ticular interest, because power ratios are closely related to the cluster dynamical state 

Substructures are common in nearby clusters. A systematic study of substructures 
in 470 clusters detected in the Rosat All Sky Survey show that ~ 50% of clusters have 
significant substructures |51| . Similar numbers are obtained from earlier Einstein obser- 
vations 49, 29, 52 or ROSAT pointed observations [5U|. A positive correlation is observed 
between the presence of substructures and the density of the cluster environment (the 
number density of surrounding clusters), as expected in the hierarchical scenario |51|. 
An anti-correlation is observed with the presence of a cooling core jSU] |2] > suggesting 
that cooling cores are destroyed during mergers. 

The power ratio technique was used to quantify the evolution of morphology with 
redshift, using a sample of 40 distant clusters observed by Chandra. As expected from 
hierarchical models of structure formation, high-redshift clusters have more substructures 
and are dynamically more active than low-redshift clusters |5.'ij . 

3'5. Mergers and non thermal emission. - The ICM is not only filled with hot thermal 
gas. The presence of a large scale magnetic field and relativistic electrons is revealed by 
radio observations of diffuse synchrotron emission: regular, centrally located, radio halos 
and/or irregular, peripherally located, radio relics |541 for a review]. Further evidence is 
provided by the detection of non thermal hard X-ray emission, interpreted as coming from 
the inverse Compton scattering of the cosmic microwave background by the relativistic 
electrons (see, e.g., the observation of A2256 with Beppo-SAX [53] and RXTE [56]). Note 
that the signal to noise ratio of such observations is still low and their interpretation 
ambiguous |55| . 

It has been known for some time that radio halos are associated with non relaxed 
clusters, as expected, e.g., if relativistic electrons are (re)-accelerated by merger shocks. 
For instance, there is a correlation between the dipole power ratio (a measure of the 
departure from the virialised state) and the power of the radio halo 57,. However, the 
origin and acceleration mechanism of the relativistic electrons is uncertain |58j . A recent 
comparison of the radio and Chandra temperature maps of merging clusters | 59 | suggests 
that radio halo electrons are mostly accelerated by the turbulence induced by mergers, 
rather than directly by shocks. However, when strongly supersonic shocks are present, 
they can also contribute to the acceleration [SUj. First direct evidence of turbulence in 
the ICM was obtained recently from the XMM-Newton pressure spectrum of Coma IHO] . 

4. Structural and scaling properties of the cluster population 

4T. The cluster population. - A wide range of morphological and physical properties 
are observed in the galaxy cluster population. As mentioned above, substructures are 
common in galaxy clusters, with a rich variety in the type and scale of subclustering. 
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The total mass of clusters ranges from 10 13 Mq for groups up to a few 10 15 M Q for 
very rich systems, while the X-ray luminosity of the hot intra-cluster medium varies by 
orders of magnitude, from 10 41 to a few 10 45 ergs/s, and the observed temperature from 
typically 0.3 up to ~ 15 keV. 

However, clusters do not occupy the whole parameter space of physical properties. We 
do see strong correlations between observed physical characteristics of nearby clusters, 
like luminosity, gas mass, total mass, size and temperature 61, 62 63, 64 . Furthermore 
there is also some regularity in shape. If one excludes major merger events or very 
complex systems (~ 20% of systems), the cluster morphology is usually dominated by a 
regular centrally peaked main component. In that case, the surface brightness, outside 
the cooling flow region, is reasonably fitted by a /3-model, once minor substructures (like 
small secondary subclusters falling onto a main virialised component) are excised 65,. 

4'2. The self- similar model of cluster formation. - Regularity in the cluster population 
is expected on theoretical grounds. The simplest models of structure formation, purely 
based on gravitation, predict that galaxy clusters constitute a self-similar population. 
In this section, I briefly summarize the main characteristics of such models. Further 
enlightening discussions on cluster formation can be found in |66j and in [2] and in other 
reviews in this book. 

As clusters are dark matter dominated objects, their formation and evolution is driven 
by gravity. The hierarchical collapse of initial density fluctuations of dark matter, which 
produces the cluster population, is a complex dynamical phenomena. It was first modeled 
using simple spherical collapse models, while a full treatment of the 3-D hierarchical 
clustering is now made with N-body simulations. From these theoretical works, key 
characteristics of the cluster population emerge: 

•Major merger events are rare. At a given time, a cluster can be seen as a collapsed halo 
of dark matter, which is accreting surrounding matter and small groups. The virializcd 
part of the cluster corresponds roughly to a fixed density contrast 5 ~ 200 as compared 
to the critical density of the Universe, p c (z) at the considered rcdshift: 



where My and Rv are the 'virial' mass and radius. p c (z) — h 2 (z)3Hq / (8irG) , where h(z) 
is the Hubble constant normalized to its local value: h 2 (z) = Q m (l + z) 3 + Ha, where 
O m is the cosmological density parameter and Ha the cosmological constant 
•There is a strong similarity in the internal structure of virialised dark matter halos. 
This reflects the fact that there is no characteristic scale in the problem. 

The gas properties directly follow from the dark matter properties, assuming that the 
gas evolution is purely driven by gravitation, i.e., by the evolution of the potential of the 
dark matter. 

• The gas internal structure is universal, as this is the case for the dark matter (Fig.^J). 
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Fig. 12. - Universal cluster density profiles ob- 
tained from numerical simulations of structure 
formation. The radius is scaled to the virial 
radius, corresponding to a density contrast of 
200 as compared to the mean density of the 
Universe in which the cluster is embedded. 
The Dark matter and gas densities are nor- 
malized to this density. Figure from |67|. 
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Fig. 13. - Virial mass-temperature relation- 
ship for simulated clusters. When the mass is 
scaled by h(z), it varies as T 3 ' 2 independent 
of redshift and cosmology. Figure from |73|. 



• The gas, within the virial radius, is roughly in hydrostatic equilibrium in the potential 
of the dark matter. The virial theorem then gives: 

(5) ^W^ kr 

where T is the gas mean temperature and /3t is a normalization factor, which depends 
on the cluster internal structure. Since this structure is universal, /3r is a constant, 
independent on z and cluster mass. 

• The gas mass fraction / gas reflects the Universe value, since the gas 'follows' the collapse 
of the dark matter. It is thus constant: 

(6) / sas = ^ = est 

\ J ./gas ^ 

Therefore, X-ray clusters are expected to exhibit self-similarity: 

• Each cluster is defined by two parameters only: its mass and its redshift. Because 
masses are difficult to measure, it is traditional to rather use the cluster temperature. 
From the basic equations Eq. 14161 one can derive a scaling law for each physical property, 
Q, of the form Q cx A(z)T a , that relates it to the redshift and temperature. The evolution 
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factor, A(z), in the scaling relations is due to the evolution of the mean dark matter (and 
thus gas) density which varies as the critical density of the Universe: 

(7) oc pdm = Sp c (z) oc h 2 (z) 

For instance, the total and gas mass scales as M gas oc My oc h^ 1 (z)T 3 ^ 2 fFig. 1 1 3|) . the 
virial radius as Ry oc h^ 1 (z)T 1 ^ 2 . The X-ray luminosity is Lx — J p 2 as dVA(T), where 
A(T) oc T 1 / 2 , assuming bremsstrahlung emission. It thus scales as Lx oc h{z)T 2 . 
• The radial profile of any physical quantity (e.g gas or dark matter density, gas tem- 
perature) can be expressed in scaled coordinates. The considered quantity is normalized 
according to the scaling relation estimated at the cluster temperature and redshift and 
the radius is expressed in units of the virial radius. Then, the scaled radial profiles are 
the same for all clusters, whatever their redshift or their temperature (see Fig. 11211 . 

The self-similar model has been validated with numerical simulations of large scale 
structure formation that include the gas dynamics but only gravitation |671 1681 1691 170] 
and by similar semi-analytical hierarchical models |71|. However, there is some ambiguity 
in the definition of the 'virial' mass of a cluster, or equivalently in the value of the density 
contrast S used in Eq. 21 This is fully discussed in |2] (see also [72] )■ In the spherical 
top-hat model, where a cluster is considered as a spherical perturbation which has just 
collapsed, the boundary of a cluster is well defined. It corresponds to S = 18ir 2 ~ 200 in 
a critical density Universe (SCDM model), while S is a function A(z, f2 m , Da) of redshift 
and £! m ,J7A in a ACDM cosmology [55] . In reality, there is no such thing as a strict 
boundary between a relaxed part of a cluster and an infall region. Recent numerical 
simulations 73 , 66 indicate that tighter scaling laws are obtained if one uses a common 
value, S ~ 200, for all redshift and cosmologies (Fig. 1 1 3|> . Nevertheless, the spherical 
model definition is often used in the literature. In that case, note that the expected 
evolution factor of the scaling laws is different: h 2 (z) is replaced by A(z, fi m , Sl\)h 2 (z) 
(Eq.0. 

Since the first X-ray observations of clusters, the statistical properties of the observed 
cluster population have been compared with these theoretical predictions. This has 
provided valuable insight into the physics that governs the large scale structure formation 
and evolution of both the Baryonic and the Dark Matter components. 

4'3. The Dark matter in local clusters. - 



4'3.1. Theoretical predictions. Recent high resolution simulations predict that Cold 
Dark Matter profiles are cusped in the center |741 1751 1761 177] . An example is the NFW 
profile [7j] given by 

, B s / s Pc(z)S c 

(8) Pdm(?") - 



(cr/i?20o)(l + cr/R 2 oo[ 



2 



where pdm^) is the mass density, i?2oo is the radius corresponding to a density contrast of 
200 (roughly the virial radius) and c is a concentration parameter. S c is the characteristic 
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Fig. 14. - Integrated total mass profiles for a 
sample of clusters in the temperature range 
2-9 keV measured with XMM-Newton. The 
mass is scaled to M200, and the radius to -R200, 
both values being derived from the best fit- 
ting NFW model. The solid line corresponds 
to the mean scaled NFW profile and the two 
dashed lines are the associated standard devi- 
ation. Figure from |85| . 
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Fig. 15. - Concentration parameter C200 versus 
the cluster mass M200 (derived from fitting a 
NFW model to the data presented Fig. 1141 . 
The solid line represents the variation of C200 
for clusters at z — from numerical simula- 
tions |78) . The dotted lines are the standard 
deviation associated with this relation. The 
dashed line represents the same relation at a 
redshift of z = 0.15( the maximum redshift for 
the sample). Figure from |85|. 



dimensionless density, related to the concentration parameter by <5 C = (200/3)c 3 /[in(l + 
c) — c/(l + c)]. If c was a constant, the profiles of all clusters would be perfectly self- 
similar. Actually, it is expected to vary slightly with z and system mass [THEE]- The 
corresponding integrated mass profile is of the form: M{r) = M2oom(cr / R200) /m(c) , 
where M200 is the mass enclosed within -R200 (the 'virial mass'). This mass profile can 
be directly compared with observations. 

The NFW density profile varies from pdm oc r -1 at small radii to pdm oc r~ 3 at large 
radii. The exact slope of the dark matter profile in the center is still debated (see [H\ [7S] 
for latest results). For instance, the simulations in (2^1 give a steeper slope /9dm oc r~ 1,5 . 



4'3.2. Observed mass profiles. With Chandra and XMM-Newton we can now measure 
precisely the total mass distribution in clusters (from Eq.|3l over a wide range of radius, 
from ~ 0.001 i?200 UM U P to ~ 0.7 i?2oo EH- The validity of the X-ray method to derive 
masses from the hydrostatic equilibrium equation has been validated for relaxed clusters, 
by comparing Chandra mass estimates with independent lensing mass estimate [811 and 
reference therein]. The observed mass profiles are well fitted using a NFW model. This 
is true not only for massive clusters [HOI El EH E3 EH EH El) but also for low mass 
systems [7|E1. A profile with a flat core is generally rejected with a high confidence level. 
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Relation 


Slope 


Reference 


Note 


Lx-T 


2.64 ± 0.27 


eh 


a 




2.88 ±0.15 


inn 




M gas -T 


1.98 ±0.18 


E2 


c 




1.71 ±0.13 


P) 






1.89 ±0.20 


EH 


c 




1.80 ±0.16 


94 


c 


EM-T 


1.38 


E3 




S-T 


0.65 ± 0.05 


.108 


d 


/gas - T 


0.34 ± 0.22 


P) 


c 




0.66 ±0.34 


P) 


e 



Table I. - Logarithmic slope of local scal- 
ing laws from the literature (not exhaustive). 
In the standard self-similar model: Lx oc T 2 , 
M g as oc T 1 ' 5 , EM oc T - 5 and S oc T. Notes: 
a: Corrected for cooling flow contribution; b: 
non coohng-flow clusters; c: Gas mass inte- 
grated within R500; d: entropy estimated at 
O.I-R200; e: /gas integrated within -Riooo- 



In a few cases ^JIESIE]) the inner slope has even been measured precisely enough( 3 ) 
to distinguish between various predictions. They favor a slope of a ~ — 1. This steep 
density profile in the center of clusters is incompatible with the flattened core DM profiles 
predicted by self-interacting Dark Matter models 82, 83 . In one case, the validity of the 
NFW model was tested up to the virial radius |79 |. 

Recently, the first quantitative check of the universality of the mass profile was per- 
formed with XMM-Newton OIHS]. As shown Fig. [HI the mass profiles scaled in units 
of i?200 and M200 nearly coincide, with a dispersion of less than 15% at 0.1 i?200- Fur- 
thermore, the shape is quantitatively consistent with the predictions. The derived con- 
centration parameters are consistent with the C-M200 relation derived from numerical 
simulations for a ACDM cosmology (Fig. I15|) . 

This excellent quantitative agreement with theoretical predictions provides strong 
evidence in favor of the Cold Dark Matter cosmological scenario, and shows that the 
physics of the Dark Matter collapse is well understood, at least down to the cluster scale. 

Finally, it is worth mentioning again the observation of 1E0657-56, which yields very 
interesting, independent constraints on the Dark Matter. A comparison of the gas, galax- 
ies and weak-lensing maps demonstrates the presence and dominance of non-baryonic 
Dark Matter in this cluster and shows that the cross-section of the dark matter particles 
is low, excluding again most of the self-interacting dark matter models |86l 187) . 

4'4. Gas properties in local clusters. - It has been known for nearly 20 years that gas 
properties deviate from the standard self-similar model predictions: the Lx~T relation is 
steeper than expected. This was the first indication that the gas physics should be looked 



( 3 ) Note that such measurements are not limited by the instrument capabilities but by the 
number of suitable targets. In most clusters the very center is disturbed (see below), invalidating 
the HE approach to compute the central mass profile. 
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Fig. 16. - The relation between the bolometric 
luminosity and the temperature. Only clus- 
ters without strong cooling in the center are 
considered. The data are mostly from GINGA 
observations. The relation is steeper than 
expected in the standard self-similar model: 
L x cx T 2 - 88±0 ' 15 . Figure from |£T]. 



Fig. 17. - The relation between the gas mass 
and the temperature. The gas mass was 
measured with ROSAT and estimated within 
iisoo- The temperature was measured with 
the ASCA or earlier missions. The relation 
is steeper than expected in the standard self- 



similar model: 
from EU. 



T 



Figure 



at more closely and that non-gravitational processes could play a role (e.g. [HH|)- We have 
now a much more precise view of cluster properties. The new picture that emerged from 
recent observations is that local clusters, down to remarkably low mass (kT ~ 2keV), do 
obey self-similarity. However, the scaling laws differ from simple expectations. In this 
part, I summarize these observations, focusing on clusters above a typical temperature 
of 2 keV. Below that temperature, one typically finds groups. The latest results indicate 
that their properties may follow the trends observed in clusters, albeit with a significantly 
increased dispersion 89 . For a recent review on group properties, the reader may refer 

to ism. 

4'4.1. Local scaling laws. Scaling laws relate various physical properties with T. 
This includes global gas properties like the X-ray luminosity Lx [21 EH] > the gas mass 
M gas [H2 H3 EH H31 or the gas mass fraction / gas |H3 ■ The corresponding scaling 
relations are steeper than expected (Tig. I16I17|) . However, an unambiguous picture can 
be obtained only by studying the internal structure of clusters. For instance, a steepening 
of the Lx~T relation could be due to a systematic increase of the mean gas density with 
T (a simple modification of the scaling laws) or to a variation of cluster shape with T (a 
break of self-similarity). By looking at radially averaged profiles of interesting quantities, 
particularly the density and the entropy, two issues can be addressed at the same time. 
The first is to see whether the profiles agree in shape; the second is to investigate the 
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Fig. 18. - Left: Surface brightness profile of 15 nearby (0.04 < z < 0.06) and hot clusters 
(kT > 3.5 keV) observed with ROSAT. Middle: Scaled emission measure profiles. The radius 
is normalized to the virial radius and EM is scaled according to the classical laws {EM oc T 1 ^ 2 



or equivalently Af gas oc 
adapted from |rj5M95| . 



T 3/2 ). Right: Same assuming M gas oc T 1 ' 94 , or EM oc T 1 ' 38 . Figure 



scaling of the profiles. The various scaling relations are summarized in Tabled 

4'4.2. Gas density profiles. The gas content and density distribution can been studied 
through the emission measure along the line of sight EM(r) — J^ 200 n% all, which is easily 
derived from the X-ray surface brightness profile (Sec. 12 21 Fig. I18|) . The scaled EM 
profiles of hot clusters measured with ROSAT were found to be similar in shape outside 
typically 0.1 — 0.2i?2oo |65H92ll53) . There is a large dispersion in the central region, linked 
with the presence of a cooling core (see Sec. I4'4.5T) . Outside that region the universal 
profile is well fitted by a /3-model, with (3 = 2/3 and R c ~ 0.12i?2oo |127| . Note however 
that more precise XMM-Newton data on hot clusters gives a steeper density profile in 
the center, even for weak cooling flow clusters This is a consequence of the cusped 
nature of the dark matter profile. 

In the standard self-similar framework, the mean gas density does not depend on 
the temperature (Eq. [71). The emission measure is thus expected to scale as -R20O1 i-e 
EM oc T 05 . The scatter in the scaled EM profiles was found to be considerably reduced 
if a much steeper EM — T relation, EM oc T 1 - 38 , was used to scale the profiles [23 
see Fig. I18| . This explains the steepening of the Lx~T relation and translates into 
M gas T 194 , which is consistent with the observed steepening of the M gas -T relation 
(Table [I| as compared to the standard M gas oc T scaling. A test case study with 
XMM-Newton of a cool cluster (A1983) suggests that clusters follow the universal scaled 
EM profile down to temperature as low as kT ~ 2 keV [7] . 

There is converging evidence that the gas distribution in clusters is more inflated than 
the dark matter distribution. The integrated gas mass fraction increases with scaled 



20 



M. ARNAUD 



radius 96 . The increase, measured with Beppo-SAX is about 25% between a density 
contrast of 6 — 2500 (about 1/3 of the virial radius) and S = 500 The results 

at 5 = 500 were largely based on extrapolations and could be biased. However, recent 
XMM-Newton and Chandra results seem to confirm this trend. From a sample of six 
massive clusters observed by Chandra, an average / gas value of 0.113 ± 0.013 was derived 
at S = 2500 |HH1 ■ At this density contrast, the value derived from the XMM-Newton 
analysis of A1413 is / gas = 0.11, perfectly consistent with Chandra results j7j3 . The 
XMM-Newton data extends up to 6 = 500 and show that / gas keeps increasing with 
radius to reach / gas ~ 0.14 at S — 500 (25% increase). 

4'4.3. Temperature profiles. There is also a similarity in the temperature profiles 
of hot clusters observed with ASCA and Beppo-SAX beyond the cooling core region 
|991 11001 HOT) . In relaxed clusters, there is usually a drop of temperature towards the 
center (r <J 0.1i?2oo)- This corresponds to the cooling core region (see Sec. I4'4.5|) . There 
is also a tendency for clusters with cooling cores to have flatter temperature profiles at 
large scale than non cooling core clusters, suggesting that the profile shape depends on 
the cluster dynamical state |101| . 

A XMM-Newton study of an unbiased sample of clusters shows a variety of shapes, 
probably linked to various dynamical states |102j . The self-similarity of shape seem to be 
confirmed by Chandra jSTJ I103j and XMM-Newton data |104l I105| for relaxed clusters. 
However, no consensus has been reached yet on the exact shape of the profiles. This 
was already the case for ASCA and Beppo-SAX studies and this is still the case with 
XMM-Newton and Chandra. Some studies find relatively flat profiles (within ~ 20%), 
beyond the cooling core region, up to 0.3i?2oo EI] or even to 0.5i?20O |104| . Other studies 
found steadily decreasing profiles, by 30% between O.I-R200 and 0.5i?2oo |105j or even by 
50% |TU3| . 

4'4.4. Gas entropy. The studies of the gas density profiles suggest that the departures 
of the gas scaling laws from the standard self-similar model are due to a non-standard 
scaling of the mean gas density with temperature and not to a break of self-similarity. 
To understand the physical origin of these deviations, one must consider the gas entropy 
rather than the density. The 'entropy' is traditionally defined as S = YTjn 2 J^ and is 
related to the true thermodynamic entropy via a logarithm and an additive constant. It 
is a fundamental characteristic of the ICM, because it is a probe of the thermodynamic 
history of the gas |1061 12] • The entropy profile of the gas and the shape of the potential 
well, in which it lies, fully define the X-ray properties of a relaxed cluster ( 4 ). In the 
standard self-similar picture, the entropy, at any scaled radius, should scale simply as 
Sex h(z) 4 / 3 T. 

Since the pioneering work of |107| . it is known that the entropy measured at O.I-R200 
exceeds the value attainable through gravitational heating alone, an effect that is es- 



( 4 ) The gas density and temperature profiles can be determined from the entropy profile and 
the hydrostatic equation (Eq.^J with a boundary condition at the virial radius 
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Fig. 19. - Gas entropy at 0.1 -R200 versus clus- 
ter temperature. Data of individual clusters, 
derived from ROSAT and ASCA observations, 
have been grouped by temperature bins. The 
dotted line shows the standard self-similar pre- 
diction, S oc T. The relation is shallower than 
expected in the standard self-similar model: 
S oc r°' 65±0 ' 05 . Figure from [TUS| . 
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Fig. 20. - XMM-Newton scaled entropy pro- 
files. The temperature ranges from 2 keV to 
6.5keV. The entropy has been scaled follow- 
ing the empirical relation, S oc h(z)' 4/3 T - 65 . 
The radius is scaled to the measured virial ra- 
dius, i?200- Figure from |19| . 



pecially noticeable in low mass systems. Various non-gravitational processes have been 
proposed to explain this entropy excess, such as heating before or after collapse (from 
SNs or AGNs) or radiative cooling. A recent study of 66 nearby systems observed by 
ASCA and ROSAT |l(J8j shows that the S-T relation follows a power law but with a 
smaller slope than expected: the entropy measured at 0.1i?2oo scales as S oc T 65 (see 
Fig. 1191) . High quality XMM-Newton observations |7l 1191 see Fig. ESI show a remark- 
able self-similarity in the shape of the entropy profiles down to low mass (kT ~ 2 keV) . 
Stacking analysis of ROSAT data gives the same results |108| . Except in the very cen- 
tre, the XMM-Newton entropy profiles are self-similar in shape, with close to power law 
behavior in the 0.05 -R200 < r < 0.5 R200 range. The slope is slightly shallower than 
predicted by shock heating models, S(r) oc r °- 94 ± 014 . The normalization of the profiles 
is consistent with the S oc T 65 scaling. Similar results were obtained more recently on 
a larger sample observed with XMM-Newton |105| . 

The self-similarity of shape of the entropy profile is a strong constraint for models. 
Simple pre-heating models, which predict large isentropic cores, must be ruled out. In 
addition to the gravitational effect, the gas history probably depends on the interplay 
between cooling and various galaxy feedback mechanisms (see [5| and other reviews in 
this book). 

4'4.5. The complex physics in cluster core. As mentioned above, there is a very large 
dispersion in the core properties, within typically O.I-R200 This is linked to the complex 
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physics at play in the cluster center. In the center of clusters the gas density is high. 
The cooling time, which scales as i coo i oc T 1 / 2 /n c can be shorter than the 'age' of the 
cluster, the time since the last major merger event. We thus expect the temperature 
to decrease due to radiative cooling and the density to increase so that the gas stays in 
quasi hydrostatic equilibrium. Both Chandra [HI] and XMM-Newton |1()9| observations 
clearly confirm this temperature drop in cooling clusters. These new observations have 
also dramatically changed our vision of cooling cores in clusters. This topic has been the 
subject of a recent conference |110| and is only briefly discussed here. 

A major surprise is the lack of very cool gas, inconsistent with standard isobaric 
cooling flow models. The XMM-Newton/RGS high resolution spectra exhibit strong 
emission from cool plasma at just below the ambient temperature, T, down to T/2, but 
also exhibit a severe deficit of emission compared to the predictions at lower temperatures 
[TTT| . The standard model is also inconsistent with XMM-Newton/EPIC data [TT21 
11131 1109| . In parallel, arc-second spatial imaging with Chandra have revealed complex 
interaction between AGN activity in the cluster center and the intra-cluster medium 114 
for a review]. One observes X-ray cavities or 'bubbles', created by the central AGN radio 
lobes as they displace the X-ray gas. They are usually surrounded by cool rims and not by 
shocked gas. However, a shock was recently discovered at the boundary of the cavities 
in MS0735. 6+7421 |115| . There are also 'ghost' cavities, probably bubbles that have 
buoyantly risen away from the cluster center. Spectacular examples of such phenomena 
are observed in the center of the Perseus cluster |1161 1117] , In this cluster ripples are 
also observed in the X-ray surface brightness of the gas surrounding the central bubble. 
This was interpreted as resulting from the propagation of weak shocks and viscously 
dissipating sound waves due to repeated outbursts of the central AGN |117| . Cold fronts 
and sloshing gas are also observed with Chandra in the center of relaxed clusters [1181 
for a review] . 

Whether and how both phenomena, the absence of very cool gas and AGN/ICM in- 
teraction, are connected is still unclear [21 11191 fl2(JI for reviews]. For instance, AGN 
heating may limit cooling but conduction could also play a role in heating the central 
region. A better understanding of cooling and AGN heating in the central part of clusters 
has further implications because both phenomena play a role at larger scales in clusters 
and during galaxy formation. Finally, the core properties have also a substantial impact 
on the Lx~T relation. When a cooling core is present, the luminosity is boosted (due to 
the peaked density profiles) and the mean temperature is decreased (due to the temper- 
ature drop in the center). This results in a large dispersion in the Lx~T relation. The 
dispersion is decreased when these effects are corrected for [jH] . Then the Ly^-T relation 
is the same as for clusters without strongly cooling cores |61| . 

4'4.6. The M-T relation. The M-T relation is a fundamental scaling relation. 
Since other scaling relations are expressed in terms of the temperature T, the M-T 
relation provides the missing link between the gas properties and the mass. Furthermore, 
estimations of the cosmological parameters from cluster abundances in clusters, or from 
the spatial distribution of clusters, heavily rely on this relation to relate the mass to 
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Fig. 21. - The M 250 q-T relation ob- 
served with Chandra for hot clusters. 
Solid line: The best fit power law, 
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Fig. 22. - The M 5Q0 -T relation observed with 
XMM-Newton. Solid line: The best fit power law 
for the whole sample. Dotted line: same for the hot 
cluster subsample. Dashed line: the predicted rela- 
tion from adiabatic numerical simulations |68|. Long- 
dashed line: the relation derived from a numerical 
simulation including radiative cooling, star formation 
and SN feedback H^J. Figure from [TH] . 



the observables available from X-ray surveys (see Sec l5'3|l . A sustained observational 
effort to measure the local M-T relation has been undertaken using ROSAT, ASCA 
and Beppo-SAX, but no definitive picture had emerged. Does the mass scale as T 3 / 2 as 
expected |121l 1931 194j ? Is this true only in the high mass range (kT J> 4 keV), with a 
steepening at lower mass [1221 11231 Itj3] ? Is the slope higher than expected over the entire 
mass range [5^]? This was unclear. The normalizations of the M-T relation derived from 
ASCA data are generally lower than predicted by adiabatic numerical simulations |68|. 
by typically 40% [1221 loT?] . On the other hand, using Beppo-SAX data, a normalisation 
consistent with the predictions was obtained, albeit with large error |93|. 

These studies had to rely largely on extrapolation to deduce the virial mass, and they 
were limited by the low resolution and the statistical quality of the temperature profiles. 
As shown above, significant progress on mass estimates has been made with Chandra and 
XMM-Newton. A first Chandra study [ST] of five hot clusters (kT > 5.5 keV) derived a 
M-T relation slope of 1.51 ± 0.27, consistent with the self-similar model (see Fig. l21|l . It 
confirmed the offset in normalisation. However, due to the relatively small Chandra field 
of view, the M-T relation was established at i?2500j i-e., about ~ 0.3i?2oo- More recently, 
the M-T relation was established down to lower density contrasts (6 = 2500 to S = 200) 
from a sample of ten nearby relaxed galaxy clusters covering a wider temperature range, 
[2 — 9] keV |124| . The masses were derived from precise mass profiles measured with 
XMM-Newton at least down to 6 — 1000 and extrapolated beyond that radius using the 
best fitting NFW model. The M2500-T for hot clusters is perfectly consistent with the 
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Chandra results. The logarithmic slope of the M-T relation was well constrained. It is the 
same at all 6, reflecting the self-similarity of the mass profiles. At S — 500 (see Fig. 12211 . 
the slope of the relation for the sub-sample of hot clusters (kT > 3.5 keV) is consistent 
with the standard self-similar expectation: a = 1.49 ± 0.15. The relation steepens 
when the whole sample is considered, but the effect is fairly small: a — 1.71 ± 0.09. 
The normalisation of the relation differs, at all density contrasts from the prediction of 
gravitation based models (by ~ 30%). Models that take into account radiative cooling 
and galaxy feedback [1251 1126| are generally in better agreement with the data |1241 for 
full discussion]. 

4'5. Evolution of cluster properties. - The standard self-similar model makes strong 
predictions for the evolution of cluster properties. Distant clusters should have the 
same internal structure as nearby clusters, but they should be denser, smaller and more 
luminous (Sec. I4'2|l . Physical properties derived from X-ray observations, as well as 
theoretical predictions, depend on the assumed cosmology. Fortunately, we have now 
concordant constraints (Sec. |5j) on cosmological parameters, showing that we live in a 
flat low density Universe (f2 m ~ 0.3, J7a ~ 0.7). This greatly simplifies the issue of cluster 
formation and evolution. 

The RO SAT and ASCA observations gave the first indication that the self-similarity 
does hold at z > 0. The universal emission measure profile appears to extend to z ~ 0.8, 
with a redshift scaling consistent with the expectation for a ACDM cosmology |127| . A 
significant evolution in the normalisation of the Ly^-T relation was obtained, consistent 
with the self-similar model. This was also the case in other studies made assuming this 
cosmology ( 5 ) |128M129| . although more recently no significant evolution was detected in 
a large sample of 79 clusters |130| . However, all these observations were highly biased 
towards massive systems, mostly clusters discovered by the EMSS, and their statistical 
quality was poor. 

With XMM-Newton and Chandra, we can now make high quality studies of dis- 
tant clusters from X-ray samples assembled using ROSAT observations (Sec. I5"3.1fl . 
These new samples cover a much wider mass range than the EMSS survey. Recent 
XMM-Newton and Chandra evolution studies do confirm that clusters follow scaling laws 
up to high z |131l 11321 125) . The self-similarity of shape was also confirmed on a test 
case at z = 0.6 |23j . However, the amount of evolution remains uncertain. For in- 
stance, depending on the data considered and the analysis procedure, the normalization 
the Lx~T relation has been found to evolve more than expected in the standard model, 
as expected or less than expected. The standard self-similar model predicts that the 
normalization A(z) varies as h(z), A{z) = h(z) ~ (1 + z)°- 6 ~°- 9 for the favored ACDM 
cosmology. The first studies comparing respectively Chandra and XMM-Newton 

|132j data with the local relation measured with ASCA jSJ gave similar evolution fac- 



( 5 ) Previous studies assumed a SCDM cosmology and found no evolution of the Lx~T relation. 
As discussed in |127| . this is an artifact due to the choice of a 'wrong' cosmology: the distance 
and therefore the luminosity are underestimated 
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Fig. 23. - Correlation between bolometric 
luminosity of distant clusters and tempera- 
ture. Filled circles: XMM-Newton data for 
SHARC clusters |132| ; Open circles: Chandra 
data |131| . When the luminosity is scaled by 



\3/2 



the data points are consistent with 



the local Lx-T relation (solid line), indicating 
evolution. Figure adapted from |132M13l1 




Fig. 24. - Entropy measured at O.I-R200 versus 
temperature for a sample of distant clusters 
observed with Chandra. Data for clusters in 
various z bins are indicated by different sym- 
bols. The entropy is rescaled according to the 
expected evolution. Dashed line: local S—T 
relation as measured by |108| . Figure from 

H31 



tors: A(z) = (1 + z) 152±0 - 26 and A(z) = (1 + z) 15 * 30 respectively. This is larger than 
expected. A more recent study [35], using a larger set of Chandra data and the same lo- 
cal reference but a different procedure, gives A(z) = (1 + z )°- 62±0 - 28 , perfectly consistent 
with the expectation. In contrast, a smaller evolution than expected was obtained when 
using the local relation measured with Beppo-SAX: A(z) = h(z)(l + z )- 104 ± - 32 . 

The evolution of the other scaling relations are still poorly constrained. The evolution 
of the M-T relation is consistent with the expectation [23]. There is some hints that 
the evolution of the M gas -T relation is smaller than expected [1311 125) . while the S—T 
relation would be higher than expected |2S1 see Fig. [2]. However, the effects are not 
very significant. 

This illustrates the difficulty in studying cluster evolution. High precision is required 
because the evolution expected in the 'reference' standard model is small. The normal- 
ization of the key Lx~T, M-T and S-T relations evolves as h(z), h(z)^ 1 and h(z)~ 4 / 3 
respectively, where h(z) is the Hubble constant. The evolution factor, h(z), varies by 
30% between z — and z = 0.5 (56% at z = 0.8). To distinguish between various models, 
statistical and systematic errors have to be well below these figures. This requires the use 
of large unbiased cluster samples, covering a wide mass range, in both the local Universe 
and at high redshift. Biased results on the evolution could be obtained, for instance, by 
comparing the Lx~T relations at various z with non representative sub-samples of cooling 
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flow clusters (or by using not fully consistent treatment of the cooling center). Ideally, 
one also wants to compare data obtained with the same instrument, in order to minimize 
systematic uncertainties. For instance, a calibration error of 10% in the kT measurements 
is equivalent to a 30% systematic error on the luminosity (Lboi oc T 3 ). Such an error 
could introduce a bias equivalent to the expected evolution. Ongoing XMM-Newton and 
Chandra large projects, aiming at studying the properties of large unbiased distant and 
local samples, will fulfill the above requirements. Significant progress on the evolution 
of cluster properties are expected from these projects. 

5. Constraining cosmological parameters with X-ray observations of clusters 

Several independent methods can be used to constrain the cosmological parameters 
from cluster X-ray observations. This includes the baryon fraction in clusters, the cluster 
abundance and its evolution, and the cluster spatial distribution. All methods rely in 
principle or in practice on specific assumptions on the scaling and structural properties 
of clusters. 

5T. The baryon fraction. - In the simplest model of cluster formation, the contents 
of clusters is a fair sample of the Universe as a whole (Sec. 14 "21 . The baryon mass 
fraction in clusters is then /b = fib/fi m , where 51b and fi m are the mean baryon density 
and the total matter density of the Universe, /b is the sum of the gas and galaxy mass 
fractions: /b = / ga s + /gai- Combined with the fib value estimated from Big Bang nucleo- 
synthesis or CMB measurements, /b in clusters can be used to measure fi m |133| . The 
method requires an independent knowledge of the Hubble constant h, which enters in the 
determination of fib (fib oc h~ 2 ), of / gas (/ gas oc ft, -3 / 2 ) and of / ga j (/ ga j cx h^ 1 ). Note 
that the baryonic mass in rich clusters is dominated by the X-ray gas. The gas mass 
fraction alone sets an upper limit on fi m . 

Cluster sample studies that rely on measurements of both / gas and / ga i are rare |134| . 
fi m is most often constrained from / gas only, assuming a constant / ga i// gas ratio, taken 
from other cluster studies [§7| 1135) . A further difficulty is that / gas increases with the 
integration radius (Sec. I4'4.2|) and numerical simulations indicate that / gas within the 
virial radius is slightly smaller than the Universe's value |70) . Observed values must be 
corrected for these effects. The correction is about 20% for /b values estimated within 
1/3 of the virial radius |135| . Corrections factors are deduced from adiabatic numerical 
simulations |1341 1135| and may not be perfectly adequate since these simulations do not 
include galaxy formation and fail to account for the observed ICM properties (Sec l4'4ll . 
One also observes a significant increase of /b with system mass, mainly resulting from 
the increase of / gas 1134j . This increase of / gas is likely due to non-gravitational 
processes, less important in high mass systems fSec !4'4|) . Therefore, most studies are 
restricted to massive clusters to minimize systematic errors. To firmly establish which 
cluster populations are fair samples of the Universe, the variation of / gas with system 
mass must be fully understood, which remains to be done. 

Present /b data provides a tight constraint on fi m . All recent studies favor a low 
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Fig. 25. - The X-ray gas mass fraction (la Fig. 26. - The 1, 2 and 3cr confidence con- 
errors) versus z for a ACDM cosmology with straints in the 0.^,^1 a plane obtained from 
fim = 0.25, Oa = 0.96. In this 'best fit' cos- the analysis of the cluster / gas data. Also 
mology / gas is constant. Figure from |135| . shown are the independent results obtained 

from CMB and SNI data. Figure from 



f2 m Universe and are in excellent agreement, e.g. £l m = 0.37 ± 0.08 from Beppo-SAX 
data [§7|, n m = 0.28 ± 0.03 from ROSAT/ASCA data [Hfl and fi m = 0.30 ± 0.04 from 
Chandra data |98| . The variation of ft, with mass and radius, mentioned above, are the 
main source of systematic uncertainties for high precision cosmology. 

5'2. The gas fraction as distance indicator. - Assuming again that / gas is universal 
and thus constant with z, f gas (z) can be used as distance indicator 136 . The values 
derived from observations depend on the angular distance as / gas oc (Da(z)) 3 / 2 . It will 
be constant only for the correct underlying cosmology f Fig. 125( 1 ( 6 ). Because the variation 
of Da with redshift is controlled by the expansion rate of the Universe, H(z), f gas (z) 
thus provides constraints on f2 m and Cl\ for a A CDM cosmology, (or, more generally, 
on the Dark Energy present density and equation of state). Note, however, that there is 
a large degeneracy between the constraints on fi m and Q\. 

Because it is a pure geometrical test, the method is straightforward. However, it 
requires high quality data (precise mass estimates) on a sample of clusters at various 
rcdshifts and it is not free of possible systematical errors. / gas varies with radius and 
mass at z — 0. Therefore, to compare the gas fraction at various z, one estimates / gas 
at the same fraction of the virial radius and for clusters of similar mass (preferentially 



( 6 ) More generally, non evolving cluster properties can be used as distance indicators. This 
includes the gas mass fraction, but also the isophotal radius |137| . or properties corrected for 
evolution, like the scaled emission measure profiles |127| 
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massive clusters) [53 [52]. This is sufficient to avoid biases only if cluster mass profiles 
are similar at all z and if / gas at a given mass does not evolve with z. These assumptions 
must be verified from detailed observations at high z (in particular observations of the 
structural properties of clusters), compared with numerical simulations. 

The principle of this test is similar to that of the cosmological test using SNI as 
'standard' candles [1381 11391 114UI and references therein]. Therefore, it probes the same 
domain of cosmological parameters as SNI experiments. However, there are also system- 
atic uncertainties inherent to the use of SNI (dust extinction, possible evolution of the 
luminosity). Much safer constraints can thus be obtained by cross-checking the results 
obtained with SNI and with the gas fraction. The constraints provided by f gas (z) (and 
SNI) complement the observations of the CMB anisotropics [1411 for WMAP results], 
which do not probe the same domain of cosmological parameters. 

The recent results obtained from Chandra observations are very encouraging |135) 
(see also [5S1E3)- The whole information provided by / gas was used: absolute value and 
apparent evolution with z. For a ACDM cosmology, this yields: £! m = 0.245 ± 0.04 and 
il A = 0.96 ± 0.2 (68% confidence level), with n h h 2 = 0.0214 ± 0.02 and h = 0.72 ± 0.08 
as only priors (Fig. I26|) . CI a is positive at the 3a level. The complementarity with 
CMB observations is obvious in Fig. [213 Note also the complementarity of / gas and 
SNI measurements: the (f2 m , Cl\) degeneracy intrinsic to pure distance measurement 
techniques (SNI or / gas (z) observations) has been broken because the local / gas value 
provides independent constraints on f2 m . On the other hand, the constraints from / gas (z) 
on f?A at a given O m value are still less stringent than those provided by SNIs fFig. I26fl . 
This can be improved using larger samples or more precise estimates of / gas - Finally 
the consistency between/ gas , SNI and CMB constraints is reassuring: their respective 
confidence contours in the £! m , fljy plane intersect in a common domain. 

5'3. Cosmological parameters from cluster abundance and evolution. - An independent 
and powerful method to constrain the cosmological parameters is to measure the growth 
rate of linear density perturbations, as reflected in the evolution of the galaxy cluster 
mass distribution function, n(M, z), i.e., the comoving number density of clusters of mass 
M at redshift z. This cosmological test is not as direct as the previous one, since the 
mass function also depends on the spectrum of the initial density fluctuations. 

From large N-body simulations |142j we can now accurately trace the formation of 
dark matter halos. Such simulations have been used to establish universal parametric 
formulae for the cluster mass function in various cosmological models |143l I144j , improv- 
ing over the original formula provided by |145j . The mass function depends mostly on 
f2 m and as (i.e. the normalisation of the fluctuation power spectrum P(k)), but also on 
h, Ob, Oa and n (the shape of P(k)). The evolution of the mass function, specially of 
the number of high mass clusters, is extremely sensitive to O m j!46l 1147] . as illustrated 
Fig. El (see also Fig. 6 in |148| 1. It was first proposed in |149| to use the evolution of 
X-ray cluster abundance to constrain fi m . 

In practice, this method requires to build large, well controlled, samples of local and 
distant clusters. Furthermore, the mass can never be directly determined and other (s) 
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Fig. 27. - Predicted evolution of the cluster 
mass function. The comoving number density 
of clusters with mass larger than M, is plot- 
ted as a function of M, for three cosmological 
models: fi m = 1. (solid lines), an open model 
with Q m — 0.3 (dotted lines) and a flat model 
with Q m = 0.3,0a = 0.7 (dashed lines). In 
each case, the predictions for z = 0.5, 0.33, 
are shown. Figure from |147) . 



observable(s) have to be used as surrogates. I review below the available data and current 
constraints on the cosmological parameters. For more detailed reviews, the reader may 
refer to [EOI EH. 

5*3.1. The X-ray cluster surveys. Beyond the galactic plane, the X-ray sky is essen- 
tially populated with AGNs (about 90% of the sources at the typical ROSAT detection 
limit) and galaxy clusters. There is a diffuse X-ray background emission, due mostly to 
the local galactic emission (in the soft energy band used normally to detect clusters). 

Detecting clusters in X-ray has several advantages |15()j . First, the detection of X-rays 
is an unambiguous signature of the presence of a true potential well. Second, clusters 
are high contrast objects in the X-ray sky (the X-ray flux depends on the square of the 
density) and can be distinguished from other sources (AGNs) by their spatial extent (since 
ROSAT pointed observations). By contrast, the field galaxy population sets a serious 
problem to optical surveys, as it rapidly overwhelms the galaxy overdensity associated 
with clusters as z increases. X-ray surveys are also less subject to projection effects. 
Third, quantitative selection criteria can be defined: one searches for extended sources 
above a given flux limit. X-ray cluster samples are thus flux-limited samples. This allows 
for precise estimates of the survey volume and thus of space density of clusters. 

However, there are several difficulties in constructing well controlled samples of X-ray 
clusters. In particular, it requires intensive optical follow-up to confirm the identification 
and to measure redshifts. Another critical point is to precisely estimate the 'selection 
function', that is, how the survey samples the true population of clusters. This includes 
the computation of the sky coverage £l(Sx)'- the effective area covered by the survey as a 
function of X-ray flux. This is not trivial because the exposure time, background, PSF, 
efficiency etc.. vary across the telescope field of view and from pointing to pointing and 
so does the detection probability. The detection probability also depends on the cluster 
morphology. In view of these difficulties, one usually relics on Monte-Carlo simulations 
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Fig. 28. - Solid angles and flux limits of various 
X-ray cluster surveys (references in text and in 
|15Up . Dark filled circles: serendipitous sur- 
veys constructed from a collection of pointed 
observations. Light shaded circles: surveys 
covering contiguous areas. Figure from |15U| . 



Fig. 29. - X-ray luminosity and redshift 
distribution of the 160SD EMSS PU 

and BCS 154 cluster samples. The dot- 
ted curves (left to right) are indicative flux 
limit of the surveys: 2.710" 12 , 1.5KT 13 , 
3.1(T 14 ergs/cm 2 /s in the (0.5 - 2) keV en- 
ergy band. Figure from |164| 



to study the survey selection function [1521 I153| . 

Several cluster surveys have been conducted since the early HEAO-1 survey (see |15U] 
for a complete list and descriptions of all X-ray cluster surveys). The RASS (Rosat 
All Sky Survey) was the first (and still unique) X-ray imaging Survey of the whole sky. 
Several cluster catalogs were or are still being constructed from the RASS. Published 
catalogs of X-ray selected clusters include the BCS |154) . the NORAS |155| and the 
REFLEX catalogs [156| . More than 1000 clusters have been discovered, mostly at low 
redshift. For instance, the REFLEX cluster catalog contains ~ 450 clusters out to 
z ~ 0.3. Serendipitous search of clusters is performed using archival pointed observations: 
first with the EINSTEIN satellite (EMSS survey HHZj), then with ROSAT, and now with 
XMM [T55| and Chandra. The EMSS cluster catalog [T531 and references therein] is still 
largely used for cosmological studies and several ROSAT serendipitous cluster catalogs 
are now available in the literature: the Bright SHARC [HO], the WARPS-I [HI], the 
Southern SHARC |162| and the 160SD |163| catalogs. The sky coverage and flux limit 
of various surveys are given in Fig. |2HI Compared to the RASS, ROSAT serendipitous 
surveys go much deeper and thus cover a larger z range (up to z ~ 1.3). However, their 
sky coverage is much smaller, so that serendipitous cluster catalogs include from 12 to 
100 clusters. 

The mass and redshift coverages depend on the survey area and flux limit in a com- 
plex way. The lower luminosity limit (and thus mass limit) increases with z (Fig. EH, 
since X-ray surveys are flux-limited, at variance with SZ surveys. The abundance of 
clusters rapidly decreases with luminosity (Fig. I30|) . To detect massive (and thus rare) 
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Fig. 30. - Compilation of high-redshift XLFs 
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limited surveys (Einstein de Sitter universe), 
compared with the local XLF (shaded area). 
Figure from |lti4| . 
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Fig. 31. - Cumulative temperature function 
(grey solid line with its 68% error band) es- 
tablished from the HIFLUCGS cluster sample 
|165| . Previous results from |91| and |167| are 
shown by the solid and dotted black lines. Fig- 
ure from |165| . 



clusters, the sky coverage must be wide. Going deeper (decreasing the flux limit of a 
survey) extends the low mass coverage. However, this does not necessarily augment the 
z coverage. Indeed, the survey area may become too small to detect clusters above the 
luminosity limit at high z. These points are illustrated Fig. 1291 

Cluster catalogs derived from X-ray surveys contain only basic information: X-ray 
luminosity and redshift. To measure the temperature (and possibly the mass) X-ray 
follow-up is needed, usually with the next generation satellite(s). This may change with 
XMM serendipitous surveys, where we expect that a fraction of the detected clusters will 
be bright enough to allow for the measurement of temperature |158| . 

5'3.2. Measures of cluster space density. The X-ray Luminosity Function (XLF) in 
various z bins, is a direct product of X-ray cluster surveys. Estimating the selection func- 
tion in terms of luminosity is simple: one has n(Lx, z) = fl(Sx) where Sx = L>x/ D\(z) 2 . 
There is an excellent agreement between the local XLFs derived from various surveys 
|150| . and between the various XLFs at high redshifts derived from ROSAT Surveys 
|164j , as shown Fig. [3U| A significant evolution of the XLF is found only at the bright 
end (Lx <; 5 1 44 ergs/s) and above z ~ 0.5 |164| . 

The drawback of using the XLF for cosmology is that the luminosity is very sensitive 
to the detailed gas properties: density distribution in the core, thermodynamic history, 
dynamical state. The XLF is thus not easily related to the mass function. The most 
common method is to combine the empirical Lx~T and the M-T relations. The scatter 
of the Lx~T relation and the uncertainty on its evolution are the major sources of 
systematic uncertainties. 

In principle, the temperature is more directly related to the mass than the luminosity, 
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and the X-ray temperature function (XTF) is a better substitute for the mass function. 
However, because the XTF is derived from cluster samples selected according to their 
flux, a knowledge of the Ly^-T relation (and its scatter) is still required to compute the 
temperature selection function from Q(Sx)- Furthermore, the XTF has generally a lower 
statistical quality and mass coverage than the XLF, because the temperature is only 
measured for sub-samples of available X-ray cluster catalogs (specially at low mass and 
high z). 

The agreement among various local XTFs is less good than for the local XLFs. This 
is illustrated in Fig. The XTF recently derived EE] from the HIFLUGCS cluster 
sample |166| . which contains the 63 brightest clusters from the RASS, is compared to 
earlier XTFs jST] (30 clusters) and QI7| (25 clusters). The XTFs agree within 20% in 
the [3 — 6] keV range but differ by as much as a factor 3 above ^6 — 7 keV (see also 
discussion in |168| and their Fig 4 and 5). The only presently available XTF at high z 
(Fig. EH) is derived from ASCA follow-up of the EMSS Survey (T55| . 

The HIFLUGCS cluster sample is currently the largest local cluster sample that in- 
corporates temperature measurements and also imagery data (from ROSAT pointed ob- 
servations). Using these data, the first (and still unique) cluster mass function, XMF, for 
X-ray selected clusters was established |166j . It can be directly compared to the theoret- 
ical predictions, but this test is not free of possible systematic errors. The masses had 
to be estimated using a simple isothermal /3-model for the gas distribution (and the HE 
equation). Moreover, the determination of the selection function is not straightforward: 
the Lx~M relation (and its scatter) must be known to deduce the mass selection function 
from fi(Sx)- This relation was estimated using an extended sample of 106 clusters. 

Constraining cosmological parameters from the XTF or the XLF requires a good 
calibration of the M-T and/or Ly^-M relations, both at low and high z, This is difficult 
to achieve, because it requires precise total mass measurements. To avoid this problem, 
it was recently proposed to use the baryon mass Mb, which is easy to measure, as a proxy 
of the total mass |169l I170| . The Baryon mass function BMF is directly related to the 
theoretical mass function under the assumption that the baryon fraction in clusters is 
universal and equal to f2b/fi m . In practice, the small variation with mass is taken into 
account. The L-y-M^ and its evolution must still be known to compute the selection 
function in terms of Mb, but this is relatively easily measured. The local BMF, derived 
from a sub-sample of 52 clusters from the HIFLUCGS sample, is plotted in Fig. 
together with the high redshift BMF derived from Chandra follow-up of bright 160SD 
clusters [15511170] . 

5'3.3. Constraints from local abundances. Recent studies of the local distribution 
functions PH EH EH EH El El El indicate that cr 8 £l m - 4 " - 6 = 0.4 - 0.6, or 
erg ~ 0.8 for f2 m = 0.3 |150j . For instance, Fig. 1321 shows the constraint derived from the 
Reflex XLF |174j , using the Ly-M relation from |166j . Note the well-known degeneracy 
between eg and fi m (see |150| for explanation). Constraints from the XTF and the XLF 
from RASS data are consistent, but the XLF gives stricter constraints due to the wider 
mass coverage |173| . 
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Fig. 32. - Likelihood contours (l-3a levels) in 
the o~g-f2 m plane obtained from the REFLEX 
cluster abundances. Note the degeneracy be- 
tween ag and Q m . Figure from |174| 



Fig. 33. - Same as Fig. 1321 but obtained using 
both cluster abundance and large-scale distri- 
bution (scale is not the same). Note the de- 
generacy breaking. Figure from |174| 



Errors on (<78,f2 m ) are currently dominated by systematic uncertainties. The major 
source of errors on erg (at fixed f2 m ) is the uncertainty on the normalization of the M—T 
relation [1731 11751 1159| . If one assumes a higher mass for a given temperature, the 
amplitude of the mass function corresponding to an observed XTF is higher and a higher 
us value is obtained. The discrepancies amoung various studies on as are largely due 
to the different normalizations used 159 . The use of different cosmological priors and 
statistical methods also contributes to the observed differences |173| . Furthermore, a 
precise knowledge and proper treatment of the intrinsic scatter of the scaling relations 
(M-T and Lx~T) is critical. For instance, neglecting the scatter in the Lx~M relation 
biases as towards high values |173| . 

5'3.4. Breaking the degeneracy using local cluster clustering. Galaxy clusters can 
also be used to trace the large-scale structure of the Universe, which depends on the cos- 
mology and the initial density spectrum. The spatial distribution of clusters thus provide 
cosmological constraints, complementary to those obtained from cluster abundance. This 
requires to survey large contiguous regions of the sky. 

The large-scale clustering and abundance of clusters was measured with unprece- 
dented accuracy with the REFLEX survey. Recently, these data were analyzed simulta- 
neously |174| . As shown in Fig. 123 this largely breaks the degeneracy between as and fl m 
observed when only the local XLF is used. The REFLEX sample gives r2 m = 0.34 ±0.03 
and as = 0.71 ± 0.04 (la statistical errors). 

5'3.5. Constraints from evolution . As mentioned above, the evolution of the mass 
function is very sensitive to Q m . The degeneracy between as and f2 m , obtained when 
using local cluster abundance, can be broken if the cluster distribution functions (XLF, 
XTF..) at higher redshift is known. 

Till recently, cosmological tests based on the evolution of the XLF gave ambiguous 
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Fig. 34. - Cumulative temperature function 
established by |159| at low (open circle) and 
high (filled circles) redshift. The high redshift 
XTF is derived from ASCA follow-up of EMSS 
cluster sample. The lines are the best fit mod- 
els (fi m = 0.28, Q,a = 0.94). Figure from [153) 
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Fig. 35. - Constraints on ag and fi m (1 and 2a 
confidence contours) derived from the temper- 
ature function and its evolution, compared to 
constraints from CMB and weak lensing ob- 
servations. Figure from |159| 



results due to the large uncertainty on the evolution of the L^-T relation |150| . This 
evolution is better constrained with XMM-Newton and Chandra. However, a consensus 
has not yet been reached. An analysis of the X-ray luminosity distribution in the RDCS 
sample (103 clusters out toz~ 0.85) gives Q m = 0.35 j;^ 3 and a s = 0.66 ± 0.06 |T7B| . 
A SCDM Universe (f2 m = 1) is excluded with a high confidence level: Q m = 0.1 — 0.6 
at the 3cr level when systematic uncertainties are taken into account. In this study, the 
normalisation of the Lx~T relation was assumed to vary as (1 + z) A , with A = - 1 
in agreement with Chandra observations. On the other hand, a study of the evolution 
of cluster number counts |177| . as measured in various surveys, using as local reference 
the local XTF |168| . gives fl m = 0.85 — 1. As in the previous study |176| . a standard 
evolution of the M—T relation was assumed, and a similar evolution of the Lx~T relation 
was used as constrained from XMM-Newton and Chandra data. 

In contrast, concordant constraints are obtained from the evolution of the XTF |159l 
see Fig or the BMF [1691 see Fig 137] . consistent with the value derived from the 
XLF RDCS study \T7Q. For instance, the BMF evolution 169 gives Q m = 0.24 ± 0.12 
for a flat Universe (68% confidence level). The XTF evolution |159) gives a best fit 
Q m = 0.33 value. It was also used to constrain the equation of state of the Dark Energy: 
w = -0.42 ± 0.21 (68% confidence level). 

5'3.6. Conclusion. Clusters of galaxies are powerful cosmological tools. Several 
independent methods can be used to constrain the cosmological parameters from cluster 
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Fig. 36. - Baryon mass function for thel60SD Fig. 37. - Comparison of constraints derived 

survey sample in the redshift interval 0.4 < from the baryon mass function (Fig. 1361 with 

z < 0.8 (filled circles) assuming Q m = 0.3 and those from distant supernovae and the CMB 

Q.A = 0.7 |169) . The local mass function from (68% confidence regions). Figure from |169| 
; 1 701 is shown by triangles. Figure from |169| 



X-ray (or S-Z) observations. They are complementary to constraints provided by CMB, 
SNI and weak lensing observations. Nearly all studies indicate a low £l m universe, a& and 
f2 m & re typically determined with ±20% accuracy, whereas J7a (or the equation of state 
of the Dark Energy) is not yet well constrained. High precision cosmology with clusters 
is a priori possible. However, the key issue is to control and decrease the systematic 
uncertainties due to our imperfect knowledge of the physics that govern cluster formation 
and evolution. This includes a better knowledge of the intrinsic scatter and evolution 
of the scaling laws. A better precision of the abundance of massive clusters at high z 
should also constrain much more tightly and This requires a survey with a very 
large sky coverage. 

6. Perspectives 

The XMM-Newton and Chandra observatories are designed to operate until at least 
2010. In the coming years, large efforts will be devoted on statistical analysis of known 
cluster samples, using archival data or large projects. Several Large Projects are in 
progress, to follow-up unbiased samples of local or distant clusters discovered by ROSAT. 
In parallel, the XMM-Newton (and to a lesser extent Chandra) satellites will provide 
new cluster samples for cosmology, extending to lower mass and possibly higher z. This 
includes serendipitous surveys, like the XCS [2E|, or contiguous surveys, like the XMM- 

LSS UZHj. 

For understanding the physics of the intra-cluster medium, key information is still 
missing. We cannot measure the velocity structure of the gas. Moreover, the high 
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resolution and spatially resolved spectroscopic data required to investigate the complex 
cluster core is not yet available. This information can only be provided by bolometer 
array type instruments, as will be on board Astro-E2, planned to be launched in a few 
months [175] . 

Understanding non thermal processes in clusters (e.g electron acceleration by shocks 
or turbulence during merger events, effect on the cluster evolution) is another open 
issue. Astro-E2 will greatly improve our capability to observe high energy emission from 
clusters. Further progresses will require spatially resolved spectroscopy at high energies 
(up to E ~ 80 keV). When combined with radio data, this will allow us to map both 
the magnetic field and the non thermal particle population. Several projects are under 
study, like SIMBOL-X Q2D] and NeXT [T75| . 

Planck (to be launched in 2007) should detect about 10000 clusters via the SZ effect. 
The statistical study of this sample, unique by its size, depth and sky coverage, will 
constrain cosmological parameters and provide information on the physics of structure 
formation, ft will be extremely useful (and sometimes mandatory) to combine the SZ 
data with X-ray data, like those obtained by XMM-Newton. Planck and XMM-Newton 
surveys have not the same sky, mass and redshift coverage and it is unrealistic to think of 
a complete X-ray follow-up of the Planck sample. However, sub-samples can still be used, 
to calibrate for instance the Y—M relation. This relation must be known to constrain 
cosmology on the basis of SZ cluster abundances. 

The next generation of X-ray observatories, Constellation-X and XEUS, is already on 
the horizon for the years 2010+ . They should allow us to probe the hot Universe at much 
higher z than presently. We will study early Black Holes, groups of galaxies at z ~ 2 and 
their evolution to the massive clusters of today, and investigate nucleosynthesis down to 
the present epoch. These satellites will perfectly complement observatories like ALMA, 
JWST that will look at the 'cool' component of the Universe. 

* * * 
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